Finding Global Optimum for Truth Discovery: Entropy Based Geometric Variance

نویسندگان

  • Hu Ding
  • Jing Gao
  • Jinhui Xu
چکیده

Truth Discovery is an important problem arising in data analytics related fields such as data mining, database, and big data. It concerns about finding the most trustworthy information from a dataset acquired from a number of unreliable sources. Due to its importance, the problem has been extensively studied in recent years and a number techniques have already been proposed. However, all of them are of heuristic nature and do not have any quality guarantee. In this paper, we formulate the problem as a high dimensional geometric optimization problem, called Entropy based Geometric Variance. Relying on a number of novel geometric techniques (such as LogPartition and Modified Simplex Lemma), we further discover new insights to this problem. We show, for the first time, that the truth discovery problem can be solved with guaranteed quality of solution. Particularly, we show that it is possible to achieve a (1 + )-approximation within nearly linear time under some reasonable assumptions. We expect that our algorithm will be useful for other data related applications. 1998 ACM Subject Classification F.2.2 Nonnumerical Algorithms and Problems Geometrical problems and computations

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Method for Root Detection in Minirhizotron Images: Hypothesis Testing Based on Entropy-Based Geometric Level Set Decision

In this paper a new method is introduced for root detection in minirhizotron images for root investigation. In this method firstly a hypothesis testing framework is defined to separate roots from background and noise. Then the correct roots are extracted by using an entropy-based geometric level set decision function. Performance of the proposed method is evaluated on real captured images in tw...

متن کامل

Exploring Relevance as Truth Criterion on the Web and Classifying Claims in Belief Levels

The Web has become the most important information source for most of us. Unfortunately, there is no guarantee for the correctness of information on the Web. Moreover, different websites often provide conflicting information on a subject. Several truth discovery methods have been proposed for various scenarios, and they have been successfully applied in diverse application domains. In this paper...

متن کامل

A Level-Value Estimation Algorithm and Its Stochastic Implementation for Global Optimization

In this paper, we propose a new method for finding global optimum of continuous optimization problems, namely Level-Value Estimation algorithm(LVEM). First we define the variance function v(c) and the mean deviation function m(c) with respect to a single variable (the level value c), and both of these functions depend on the optimized function f(x). We verify these functions have some good prop...

متن کامل

An Entropy-Based Position Projection Algorithm for Motif Discovery

Motif discovery problem is crucial for understanding the structure and function of gene expression. Over the past decades, many attempts using consensus and probability training model for motif finding are successful. However, the most existing motif discovery algorithms are still time-consuming or easily trapped in a local optimum. To overcome these shortcomings, in this paper, we propose an e...

متن کامل

Real Time Object Detection and 3D Modeling Using Fuzzy Logic

This paper OD3DM (Object detection and 3D modeling) mainly discussed the process to detect complex geometric objects and thereafter performing 3D modeling of geometric objects using Entropy based selection of optimum transformation of input data, wavelet based transformation and fuzzy logictechniques for designing and training of object recognition systems using realistic 3D computer graphics m...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016